Evidence of automatic processing in sequence learning using process-dissociation

This paper proposes a way to apply process-dissociation to sequence learning in addition and extension to the approach used by Destrebecqz and Cleeremans (2001). Participants were trained on two sequences separated from each other by a short break. Following training, participants self-reported their knowledge of the sequences. A recognition test was then performed which required discrimination of two trained sequences, either under the instructions to call any sequence encountered in the experiment “old” (the inclusion condition), or only sequence fragments from one half of the experiment “old” (the exclusion condition). The recognition test elicited automatic and controlled process estimates using the process dissociation procedure, and suggested both processes were involved. Examining the underlying processes supporting performance may provide more information on the fundamental aspects of the implicit and explicit constructs than has been attainable through awareness testing.

Tests that attempt to ascertain the level of explicit knowledge have common issues (see Shanks & St. John, 1994), such as potential insensitivity to participants' explicit knowledge, and face the fundamental challenge that there would be no way to rule out the possibility that some level of awareness exists (Merikle, 1994). A further concern is that explicit knowledge tests are contaminated by implicit knowledge, and vice versa (Neal & Hesketh, 1997).

Issues in awareness testing
One of the problems with distinguishing between implicit and explicit performance is the very basis on which that distinction can be drawn.
There are several ways to assess a participant's awareness of what has been learned within an implicit learning paradigm. Some awareness testing methods are more readily amenable to specific implicit learning tasks than others, and different tests do have issues that impact how meaningful the results are. One straightforward awareness test is the self-report of awareness of the repeating information presented during the experiment (see Frensch & Runger, 2003;Seth, Dienes, Cleeremans, Overgaard, & Pessoa, 2008;Shanks, 2005). An assumption underlying use of this test is that explicit knowledge can be verbalized, such that a lack of verbalizable knowledge implies implicit learning. One key problem is that a participant may have explicit knowledge, and this information may even be readily available to conscious thought, but the nature of the test may prevent them from describing or conveying this knowledge (Shanks & St. John, 1994).
As discussed by Cheesman and Merikle (1984), self-report tests likely only tap participants' knowledge of what they know they know (i.e., metaknowledge). They argued there are two thresholds important for non-conscious knowledge: a subjective threshold below which participants feel they do not have knowledge, and an objective threshold wherein participants do not feel they have knowledge in addition to not displaying it. Therefore, asking participants to freely verbalize what they know requires the knowledge to be above the subjective threshold. Dienes and Berry (1997) expanded this idea to implicit learning tasks, including SRTT. By definition, whatever knowledge participants have exceeds the objective threshold, be it implicit or explicit knowledge, and knowledge that participants do not have, is below the objective threshold. Implicit knowledge would be above the objective, but below the subjective threshold. Finally, explicit knowledge would be above both thresholds. They suggested that knowledge acquired through SRTT training is above the objective threshold as participants reliably speed up to the trained sequence. Furthermore, there is evidence that for many participants the acquired knowledge is below the subjective threshold, since conscious knowledge of what was learned is not necessary for the faster reaction times. Self-report tests should tap knowledge available above the subjective threshold, whereas cued tests (i.e., recognition and generation tests) should assess knowledge above the objective threshold, regardless of its relation to the subjective threshold. The distinction between cued and free report tests is included in this framework as cued tests should provide a more thorough measure of the knowledge above the objective threshold. Distinguishing whether knowledge is above (i.e., explicit) or below (i.e., implicit) the subjective threshold is naturally much more difficult than determining its relation to the objective threshold. Furthermore, all knowledge above the objective threshold could be employed during cued tests.
Consequently, in this framework the results of the cued tests could be thought to provide little distinguishing information on the nature of the underlying knowledge without further manipulation.
One critical point in assessing conscious awareness is that implicit knowledge could be contaminating the explicit knowledge measurements in that the implicit knowledge would inadvertently be used during the explicit measurement (e.g., Shanks & Johnstone, 1999). This is a valid concern that the aforementioned awareness tests are ill-equipped to handle.

Process-dissociation procedure
If the previously discussed tests necessarily contain contributions from multiple knowledge sources, then a different approach must be adopted. Reingold and Merikle (1988) proposed that, to circumvent needing pure measures of conscious or non-conscious knowledge, the relative sensitivity of two measures could be compared. One measure is a direct test of knowledge, in which participants are instructed to make a discrimination (e.g., was an object seen or not). The other measure is an indirect test of knowledge, wherein the discrimination is not part of the instructional set. The necessary assumption for this comparison to be valid is that the direct test is more sensitive to conscious knowledge than the indirect test. Therefore, non-conscious knowledge is assumed to be present when discrimination accuracy is higher or reaction times are quicker for the indirect test than the direct test. This concept, origi- The rationale behind PDP for decoupling controlled and automatic processing is to create a test condition for which both controlled and automatic processes can lead to a correct response (the inclusion condition), and a test condition in which a failure of controlled processing leads to errors, presumably reflecting automatic processing when controlled processing fails (the exclusion condition). PDP has a long history in implicit memory research (cf. Yonelinas, 2002), and as such has had many refinements to the technique. Therefore, many of the issues have been explored in the literature leading to a stable tool in dissociating controlled from automatic processes.
Similarities between both implicit memory and implicit learning (Buchner, 1994;Buchner & Wippich, 1998) suggest that implicit learning research would benefit from incorporating a dual-process model and approach (Jacoby, Kelley, & Dywan, 1989;Yonelinas, 2002). Thus, using the PDP approach is a natural step in understanding what is happening during implicit learning. Although the PDP approach has been criticized by some researchers (Curran & Hintzman, 1995;Dodson & Johnson, 1996;Graf & Komatsu, 1994), the procedure has proven to be a useful tool across many disciplines within psychology.
The relation of controlled (C) and automatic (A) processing on the inclusion test is as follows: By contrast, performance on the exclusion task is accomplished primarily by recollecting items from the study context, which requires controlled processing. In the present study, this involves recollecting which half of training the sequence came from. Thus, errors on the exclusion test reflect A in the absence of C, which is captured in the following equation: That is, exclusion performance equals the probability that A influences responding given that C failed to influence responding. In practice, calculating the C component of a task is achieved by simply subtracting the false alarm rate for the exclusion condition, which is based on A in the absence of C, from the hit rate for the inclusion condition, which is based on both C and A: A can then be calculated by dividing false alarms on the exclusion task (automaticity in the absence of control), by the inverse of the controlled process estimate, as shown by solving Equation 2 for A: Calculating the process estimates avoids the assumption that implicit and explicit processes are used in isolation from each other, and also considers the underlying processing in implicit learning tasks rather than focusing solely on determining the awareness of the participant. The idea that implicit learning might involve automatic processes is not new (cf. e.g., J. Cohen & Poldrack, 2008;Frensch, 1998;Hayes & Broadbent, 1988;Jimenez & Mendez, 1999;Soetens, Melis, & Notebaert, 2004). Furthermore, this is not proposing to solve the debate over how many learning systems are necessary (e.g., Frensch & Runger, 2003;Shanks, 2005;Shanks & St. John, 1994), but instead measuring one part of the whole learning system and its effect on performance.
In the present study, different sequences were employed in the two distinct halves of the training. The inclusion test instructions for a subsequent recognition test asked participants to respond "old" if the presented sequence was from either half of the training phase, and thus, knowledge of either sequence would lead to a correct response; this could result from the influence of either controlled or automatic processing. For the exclusion task instructions, participants responded "old" only if the sequence was from one half of the training phase.
Thus, accurate responding in the exclusion condition requires that participants can identify the sequence as having been presented during training, and additionally, requires a temporal discrimination (e.g., Was it from the second half of training?). Consequently, to the extent that controlled processing fails, but automatic processing influences performance, participants will make errors on the exclusion test.
Similar to the memory experiments (Jacoby, 1991;Nairne & Kelley, 2004), the current experiment required participants to discriminate which half of the experiment an item (i.e., sequence) appeared in for the exclusion directions. If participants have control over the sequence knowledge they were trained on, they should be able to successfully make this discrimination. If participants do not have adequate control over this knowledge, then they will be unable to exclude the directed trained sequence as they will be relying on the general familiarity for both sequences in responding. This will lead to a higher automatic processing estimate, and a low or non-existent controlled processing estimate. 1 For a participant to have a controlled process estimate in this experiment, the participant needs to have access to both the two sequences that were trained, but also the half each sequence was trained.
Under the exclusion directions, participants will need the correct temporal cue (i.e., experiment half) associated with the to-be-included or -excluded sequence to make the discrimination. If they are only able to recognize that the sequence had been experienced before, but not at which half, their control over the knowledge for that sequence is incomplete (Shiffrin & Schneider, 1977). In this case, their judgment will be based on the automatic processes supporting familiarity.
There have been concerns about the possibility that participants could have awareness of the learned information, but not have access to when it was learned. One valuable perspective on this issue comes from Yonelinas and Jacoby (1994). This "partial recollection" of the learned information should result in the automatic estimates looking like the controlled. This is because the partial recollections would result in those items being treated as familiar rather than actually recollected, thus bleeding the partially recollected items into the automatic process estimate. It was concluded that this must be infrequent compared to the full recollection rates, and was not a great concern. 2 In the acquisition phase of implicit learning experiments, participants are usually only exposed to one repeating sequence (Nissen & Bullemer, 1987) or to one set of repeating information (e.g., the repeating visual search arrays of contextual cuing, Chun & Jiang, 1998 It has been proposed that the SRTT, and implicit learning research in general, can benefit from shifting to thinking of the underlying processes for non-intentional learning (e.g., Cleeremans, 1997;Jimenez & Mendez, 2001;Perlman & Tzelgov, 2006. One conceptualization of an automatic process assumes that the process is capable of occurring without conscious control and without intention (Bargh, 1989(Bargh, , 1992; but see Shanks & Johnstone, 1999). However, all participants will employ both automatic and controlled processes during learning and at test (Buchner & Wippich, 1998). Thus, this study will determine if the standard version of PDP can be implemented within the SRTT paradigm, and based on that investigate the relative influence these two processes have on the task.

Current experiment
This experiment expanded on the research by Destrebecqz and Cleeremans (2001) in deriving process estimates for trained sequences.  (Rajaram & Roediger, 1993). Furthermore, both controlled and automatic processes are expected to be used at the ending test as these are posited to support most, if not all, decisions (Jacoby, 1991;Jacoby, Lindsay, & Toth, 1992;Jacoby, Toth, & Yonelinas, 1993).
While there will likely be differences in process estimates between participants, the question remains whether predominant automatic processing can drive enhanced performance with the SRTT. If automatic processes are sufficient for SRTT, participants who did not show evidence of significant controlled processing at test should still have the speeded serial reaction times after acquisition of the trained sequence.
Finally, it was expected that overall there would be both controlled and automatic processing estimates; that is, both processes would support the speeded responses on trained sequences.

Participants and materials
Forty-six Colorado State University students participated in exchange for partial course credit. All participants had normal or corrected-tonormal vision.
All stimuli were shown and data were collected in E-Prime (Schneider, Eschman, & Zuccolotto, 2002). The SRTT display consisted of a white background with four black square outlines evenly spaced in a row along the center of the display. These four black square outlines essentially served as the place holders for where the target could appear on any trial. Each square was 5 cm along a side, the black outline was 2 mm thick, and the squares all were filled with white. The target was 3.8 cm in diameter, and colored green. It would appear in the center of the appropriate square for each trial.
In total, four ambiguous (A. Cohen, Ivry, & Keele, 1990), second order conditional sequences (Reed & Johnson, 1994) were used in this experiment. All sequences had equivalent response frequencies.
The two training sequences were presented to all participants, but in a counterbalanced order. Participants were trained on the sequences shown in the training column of Table 1. Learning of the practiced sequences was assessed through the introduction of new sequences at the eighth block during each training half, known as the transfer sequences.
These sequences were chosen as they had minimal overlap in similar three and four item response chunks. Importantly, the two training sequences had no three or four overlapping item response chunks. were instructed to respond as quickly and as accurately as they could.

Procedure and measures
Each trial started when the target appeared in one of the four possible locations and ended when participants made the correct response.
After the correct response was made, the target appeared with no delay at the next location. Participants were given four practice trials before starting the experiment.
There were two phases of training blocks, each phase featuring practice with a different 12 item sequence as described above. Each phase had nine blocks of 100 trials each. The first four trials of each block were pseudo-randomly generated and not part of the sequence.
Therefore there were 900 trials per half, of those 864 being sequenced trials. There was a mandatory 10 s break between each block, after which participants were free to rest further if they chose. The eighth block switched participants to a non-trained sequence, which was different for the two sets. The ninth block returned participants to the repeated sequence for that section of the experiment. Thus, participants responded to 1,800 stimuli between the two lists, 1,728 of which followed one of the two practiced sequences. In between the two sets, there was a 3 min distracter task.
After participants completed both training sets, their knowledge of the sequences was assessed using two measures. The first was a simple self-report, in which participants were asked, "Did you notice anything

Training performance
The reaction times from training were submitted to a 2 (train- sequences Participants Were trained on, transferred to during training, and exposed to during the recognition test.
Note. The leftmost response position is designated by 1, and the rightmost is designated by 4.

Self-report
Participants were asked at the end of training if they noticed anything Cohen's d = 1.05. This finding is congruent with previous suggestions that those participants whose self-reports reflect little awareness of any repeating information in the experiment were nonetheless able to learn the sequences (although as discussed previously, this does not necessarily translate to evidence of purely implicit learning). A first conclusion that can be drawn is that awareness is not necessary for sequence learning, which replicates previous sequence learning findings for the necessity of awareness (e.g., Clegg et al., 1998).

Recognition test
For the recognition test, participants had been asked to respond to a series of nine locations, as they had during training, but then indicated if this set was "old" or "new" according to the type of instructions. Under inclusion instructions, participants were to call a sequence fragment "old" if they had encountered it during the experiment at any point.
The exclusion instructions were to call a sequence fragment "old" only if it was from the second half of the experiment. The likelihood participants called each item type "old" was calculated under both forms of instruction, and is shown in Table 2. In order to calculate PDP estimates, response bias should be roughly equivalent for the inclusion and exclusion tests (Jacoby, 1998) 4 . Response bias was examined by comparing the false alarm rate to the new random sequences, which did not differ for the inclusion and exclusion tests, t(45) = 2.0, p > .05.
There was also not a difference between counterbalance conditions in recognition responses, F(3, 42) = 0.16, p > .05, η p 2 = .01, and as such will not be discussed further.
As per the PDP, the controlled and automatic processing estimates were calculated using the formulas presented in the Introduction for the to-be-excluded sequence. These rates can only be calculated for the to-be-excluded sequence because this is the only sequence in which there can be failures of controlled processes allowing the trained automatic processes to influence responses. The process estimates were submitted to separate t-tests to examine whether they were greater   Table 3. A second conclusion from this experiment is that having control over the sequence knowledge gained from training is not necessary to receive the reaction time benefit.
To test for a difference between self-report aware and non-aware participants in their ability to control the acquired knowledge, the controlled process estimates were compared between self-reported aware controlled and automatic process estimates for the recognition test. error bars represent one standard error.  controlled processing. This could also be due to an underestimation of the "true" controlled processing occurring in participants, but this would need to be verified by tests with greater sensitivity to processing.
Correlations were run between the transfer cost during training, the self-report measure of awareness, and controlled and automatic process estimates to test for further relations between the different factors. As shown in Table 4, the only significant correlation was between the training transfer cost and self-reported awareness. This small correlation suggests that participants that reported awareness tended to have greater transfer costs. To probe this issue in a more meaningful way, future experiments should use self-report measures that allow for a greater continuum of awareness states. There were no significant correlations between the self-report measure of awareness and the process estimates.

dIscussIon
The current study replicated the well-established finding that participants learn a repeated sequence of items within the SRTT paradigm, with shorter latencies in the trained sequences than in the novel sequences encountered towards the end of each training segment. The process estimate data provide evidence consistent with automatic processes operating. However, while the results support a role for automatic processing, the recognition test also shows an influence of controlled processing that highlights the presence of both types of processes in this task.
Implementing the PDP in sequence learning has allowed for a new and different way of examining the behavioral consequences of training participants on repeating sequences. The ability to get through 1,600 trials in 1 hr makes it possible to have participants make a temporal comparison of when they had encountered the probed sequences at test, thus allowing the conditions necessary to meet the PDP assumptions. By having two trained sequences in addition to the new and random test sequences, participants could make exclusion errors thought to be driven by automatic processes (Jacoby, 1991(Jacoby, , 1998.
Importantly, the trained sequences were not so highly learned or sufficiently distinctive that participants showed perfect recognition. In situations in which participants would be able to recognize all the learned elements as "old", an absence of necessary errors would then render the PDP inappropriate to use.
The current experiment featured the presence of two learned sequences that needed to be discriminated from each other. Importantly in the context of sequence learning, this rules out the possibility that participants can use perceptual and motor fluency during the recognition test to influence their classification of sequences (see Perruchet & Amorim, 1992). While more fluid execution of a sequence might provide opportunities to distinguish an old sequence from a new one, regardless of whether controlled processes were operating, it would not provide a basis to determine when within the experiment the sequence had been practiced. Previous sequence learning research using the PDP has sought ways to circumvent this issue, for example, through the use of generation of sequences rather than responses to them (Destrebecqz & Cleeremans, 2001), or the addition of further measures (Buchner et al., 1997). However, the design employed here requires minimal variation from the originally developed SRTT method, and hence offers some advantages over previous instantiations of the PDP within sequence learning. For instance, the inclusion of the temporal discrimination at test allows for responses to be made on more than just motor fluency as greater knowledge of when the sequence had been encountered is required.

The role of awareness
One advantage in investigating implicit learning from an information processing approach is that while knowledge is hard to satisfactorily and exhaustively measure, the underlying processes are more readily testable. In addition to having measures that are more objective than self-report tests, shifting to a processing view of implicit learning also allows for more direct explanations of how the performance changes during and after training. Instead of inferring how the implicit and explicit knowledge types are thought to influence performance, the PDP assesses automatic and controlled processes that support performance.
The conditions for how these processes develop and when they tend to be employed can be further investigated using this technique.
Some theoretical accounts of sequence learning (e.g., Keele et al., 2003) already somewhat marginalize the issue of awareness. Awareness may not be a necessary characteristic of any of the processes or systems involved in sequence learning (see also Clegg et al., 1998). We believe conceptualizing performance within implicit learning tasks in terms of the underlying processes, and in particular automatic processing as identified through the PDP approach, provides a means to move beyond debates about the awareness or implicit versus explicit knowledge. Moreover, the mere ability to impose controlled processing within a task need not indicate that the relevant knowledge for task performance is "explicit". Examples from motor skill performance show that automatic processes can even be disrupted, and perfor-mance degraded, if superseded by conscious monitoring (e.g., Beilock & Carr, 2001).
Further, the fact that participants who lacked controlled processes at test still had a reaction time benefit during training indicates that major use of controlled processes over what has been learned is not necessary. This is in line with definitions of implicit learning that specify the learning can occur without awareness or intent (e.g., Frensch, 1998;Seger, 1994). However, it is possible that even the PDP used in this experiment was insensitive to what controlled knowledge those participants had (Shanks & St. John, 1994). But given the nature and assumptions of PDP (Jacoby, 1991(Jacoby, , 1998Yonelinas & Jacoby, 1994), this seems unlikely. This also again illustrates the utility of moving to a processing account of what changes after training on an implicit learning task. By focusing on the measurable differences in processing, we are no longer reliant on the introspective feelings of awareness as a primary index of how participants are performing the task.

Future directions
One question for future research is whether recognition tests of the type employed here, and generation tests used in other sequence learning studies (e.g., Destrebecqz & Cleeremans, 2001), are tapping the same underlying processes. In one sense this issue can be related to Shanks and St. Johns (1994) information criterion: whether information used within the test is tapping the information involved during the execution of the actual task. As shown with perceptual priming (e.g., Rajaram & Roediger, 1993), comparing different tests can help inform theory of the underlying processes or testing strategies by looking for similarities and differences in the tests leading to differences in performance. It may also offer insights into any strategic differences between participants on different forms of tests.
Further, while it may seem that previous experiments using generation and recognition tests for SRTT (e.g., Shanks & Johnstone, 1999) have contradictory findings, closer examination reveals that the results are in agreement. Shanks and Johnstone trained participants on similar sequences to the ones used in the current experiment, then administered a free-generation or recognition test. Their results for both recognition and free-generation test methods indicated that participants had some explicit knowledge of what had been learned. In the current experiment, we probed the broad category of explicit knowledge further by using the PDP to tease apart the processes supporting the apparent explicit knowledge at test. We chose to implement only a recognition test as it guaranteed that participants were being re-exposed to the trained sequences during test, thus forcing them to discriminate between them in their decisions.
Another issue for future research concerns the question whether the temporal discrimination used in this experiment led to an underestimation of the controlled processing estimate since there may exist control over the sequence knowledge itself independent of which half the sequence was occurred in. It is possible that this may be an infrequent occurrence (as concluded by Yonelinas & Jacoby, 1994), or could warrant further methods to allow for valid PDP comparisons.
The results of the correlations between transfer cost, self-reported awareness, and process estimates also suggest further experiments probing these relationships. It is possible that the two-alternative measure for self-reported awareness helped to inflate the correlation with transfer cost. Future experiments should employ a self-reported awareness measure with more responses to test if the correlation still holds with transfer costs.

Conclusions
Leveraging insights from implicit memory research can provide a foundation for progress on implicit learning. The PDP approach may help implicit learning research move away from arguing over the semantics of what is meant by implicit or explicit and back into the interesting nature of learning processes by providing a way to measure the likely underlying processes. This experiment demonstrated one possible method of using the process estimates to measure the relative contributions of automatic and controlled processes. Additionally, the process estimates were then used to then try to account for differences in training performance. Future improvements, such as non-temporal discriminations, are still possible with the reported technique. SRTT seems to rely on both automatic and controlled processing.

Footnotes
1 It is possible that participants will lack sufficient control to reliably discriminate the temporal half (Shanks & St. John, 1994). However, even if this is the case, since PDP allows for the calculation of relative use of the two processes, the process estimates will reflect the use of controlled processing if participants are able to discriminate temporal half at least some of the time.
2 This admittedly comes from a field other than sequence learning. However, they argued that their results should generalize to other forms of memory.
3 It is possible that the recognition PDP test will underestimate the controlled process estimate. However, the process estimates are relative estimates of the two process types, so if the controlled process estimate is underestimated the automatic process estimate will be overestimated.
But because the two process estimates are mathematically linked, it is incorrect to directly compare the two process estimates. To spoil the results, a controlled process estimate was found, so the concern that a false rejection of controlled processes occurring is unwarranted. 4 These proportions should not be confused with accuracy. These are the proportion of how often each item type was identified as "old".
The concept of a base error rate derived from accuracy is not as important for PDP as the participants' base rate of incorrectly identifying novel information as having been encountered before. If the false alarm rates differed between the inclusion and exclusion tests for these novel items, it would indicate that participants are using different strategies between tests. If this were true, it would be inappropriate to calculate process estimates.

Author note
The authors wish to thank the reviewers for their helpful comments in producing this document. This research is from a thesis submitted to the Academic Faculty of Colorado State University in partial fulfillment of the requirements for the degree of doctor of philosophy.